Where Do Se-trees Perform? (part I)

نویسنده

  • Ron Rymon
چکیده

As a classiier, a Set Enumeration (SE) tree can be viewed as a generalization of decision trees. We empirically characterize domains in which SE-trees are particularly advantageous relative to decision trees. Speciically, we show that: 1. SE-trees excel in domains in which relatively few examples are available; and 2. SE-trees excel in noisy domains. In noisy domains, we discover that SE-trees perform more consistently (measured by the variance in error) in one part of the spectrum, and less consistently in the other; in the lack of noise, we nd that SE-trees are almost invariably more consistent than their decision tree counterparts. Finally, we develop a simple complexity measure based on a target function's syntactic form, and show that SE-trees enjoy a particular advantage in more complex domains.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Mineral Chemistry and metamorphic evolution of the Late Neoproterozoic metabasites of Do-Chah metamorphic - igneous complex (SE Shahrood)

Metapelites of the Do Chah complex (SE Shahrood) are composed of micaschist, garnet micaschist, chloritoid schist and garnet-bearing gneiss. In the highest degree of metamorphism, metapelites have been affected by partial melting, resulting as granitization. A significant part of these rocks, imposed by compressional tectonic regime and show typical evidence of plastic deformation and intensive...

متن کامل

Counting the number of spanning trees of graphs

A spanning tree of graph G is a spanning subgraph of G that is a tree. In this paper, we focus our attention on (n,m) graphs, where m = n, n + 1, n + 2, n+3 and n + 4. We also determine some coefficients of the Laplacian characteristic polynomial of fullerene graphs.

متن کامل

Classification trees as an alternative to linear discriminant analysis.

Linear discriminant analysis (LDA) is frequently used for classification/prediction problems in physical anthropology, but it is unusual to find examples where researchers consider the statistical limitations and assumptions required for this technique. In these instances, it is difficult to know whether the predictions are reliable. This paper considers a nonparametric alternative to predictiv...

متن کامل

CannyFS: Opportunistically Maximizing I/O Throughput Exploiting the Transactional Nature of Batch-Mode Data Processing

We introduce a user mode file system, CannyFS, that hides latency by assuming all I/O operations will succeed. The user mode process will in turn report errors, allowing proper cleanup and a repeated attempt to take place. We demonstrate benefits for the model tasks of extracting archives and removing directory trees in a real-life HPC environment, giving typical reductions in time use of over ...

متن کامل

Modular Semi-automatic Formal Verification of Critical Systems Software ; Modulaire halfautomatische formele verificatie van kritische systeemsoftware

In the first part of this thesis, we present a case study on successfully verifying the Linux USB BP keyboard driver. Our verification approach is (a) sound, (b) takes into account dynamic memory allocation, complex API rules and concurrency, and (c) is applied on a real kernel driver which was not written with verification in mind. We employ VeriFast, a software verifier based on separation lo...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007